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COMPRESSION OF DATA WITH SYNTACTIC STRUCTURE 

1 . Introduction to the General Topic 

5 The coding of source files has important technical 
applications in the area of transmission and storage of 
information. The desire to have a low data rate in the 
transmission of information, and a small space 
requirement in the storage of information, has given 
10 rise to the demand for a form of information coding that 
is as brief as possible, i.e. one with little or, 
ideally, no redundancy. 

2. State of the Art 

15 

2.1. Forms of Redundancy 

Redundancy in streams of source symbols takes two basic 
forms: it can appear as a differential frequency of 
20 occurrence of the elements in the set of source symbols, 
and in the form of statistical dependencies between 
temporally separated source symbols, which usually are 
caused by source-specific formation laws for the 
construction of a text as a sequence of source symbols. 

25 

3 . Disadvantages of Known Methods 

For the compression of messages from sources that suffer 
from redundancy and emit source symbols that are not 
30 uniformly distributed but are statistically independent, 
a procedure according to Huffman [1] is known. Being 
specially intended for this type of source, the 
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procedure enables maximal compression only under the 
extremely favorable condition that the statistical 
properties of the source are known or can be reliably 
estimated . 
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Whereas estimation of the probabilities of occurrence of 
individual source symbols, which is necessary for 
Jjj Huffman coding, can be done with sufficient accuracy 

tjj even in the case of moderately long texts, the 

estimation of statistical dependencies between 
temporally separate source symbols as a rule requires 
extensive observation of the source, which goes far 
beyond what is feasible both technically (storage 
capacity) and in terms of observation time. For this 
15 reason the known or similar procedures always involve 
making assumptions about the underlying formation law or 
crucial characteristics thereof, and then of course are 
suitable only for sources with this formation law. This 
qualification also applies to the Lempel-Ziv (LZ) 
20 procedure for data compression [2], which has been known 
since the end of the 1970' s. This procedure is suitable 
for the compression of arbitrary linear symbol 
sequences; that is, it can be applied with no knowledge 
of the statistical properties of the information source. 
25 The main areas of application today are the compression 
of pictorial data (GIF file format) and arbitrary files 
(archiving programs gnuzip, pkzip) . In contrast to 
Huffman coding, this procedure utilizes statistical 
dependencies between temporally successive symbols, by 
taking into account repeatedly occurring subsequences of 
symbols (strings) . Hence LZ compression is suitable only 
for sources, the formation law of which leads to the 
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frequent occurrence of particular strings. The 
redundancy introduced into a text by following 
grammatical rules during its construction is not 
accessible to the known compression procedures because 
of the recursive structure of grammatical rules, since 
these procedures merely employ symbol frequencies or 
repetitive chains of alphanumeric characters for 
compression . 



10 4. Definition of the Object of the Invention 

m 

K 4 The object of the present invention is to provide a 

p method and apparatus for implementing the method that 

ff enable a text consisting of a stream of alphanumeric 

H 15 characters, and is constructed according to grammatical 

f% rules, to be encoded in such a way as to remove the 

redundancy produced as a result of the limitation of all 
possible character sequences by the grammar. 

2 0 4.1 Summary of the Invention 

According to a first aspect of the present invention 
there is provided a method for the compressed 
transmission and/or storage of a text represented by 

25 digital data, the structure of which is defined by a 
grammar, wherein 

prior to storage and/or transmission the text is 
converted into a linear sequence of symbols that specify 
the successive application of grammatical rules to form 

30 the text, and 

to decode the text after the storage and/or 
transmission, for every symbol thus produced by syntax- 
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directed coding (SDC) the grammatical rule corresponding 
to that particular SDC symbol is performed and 
particular output data are generated by this rule. 

5 According to a second aspect of the present invention 
there is provided an apparatus for the compressed 
transmission and/or storage of a text represented by 
digital data, the structure of which is defined by a 

grammar, comprising 

'13 . , 

l,s, • 10 a scanner to transform the text comprising a 

sequence of readable characters into a sequence of 

\l terminal symbols, 

a parser to find the grammatical rules, the 

^ successive application of which was originally used to 

I*i h 

H> 15 generate the sequence of terminal symbols, 

a mapper, which unambiguously associates syntax- 
directed symbols with the rules identified by the parser 
and outputs these symbols in a fixed sequence, and 
a decoder comprising 
20 (a) a stack machine which, according to the 

uppermost symbol in the stack memory and any adjacent 
SDC symbol, outputs the already fixed terminal symbol, 
or deposits the sequence of terminal and/or non-terminal 
symbols associated with the current symbol into the 
25 stack memory, and 

(b) a lexicon that replaces the terminal symbols by 
chains of readable alphanumeric characters. 



According to a third aspect of the present invention 
there is provided an encoder comprising 

a scanner to transform a program, present in the 
source text or in a form derived from the source text by 
a preprocessor, into a sequence of terminal symbols, 

a parser to find the grammatical rules, the 
successive application of which was originally used to 
generate the sequence of terminal symbols, 

a mapper, which unambiguously associates syntax- 
directed symbols with the rules identified by the parser 
and outputs these symbols in a fixed sequence, 
and a decoder comprising 

a stack machine which, according to the uppermost 
symbol in the stack memory and any associated SDC 
symbol, outputs the already fixed terminal symbol, or 
deposits the sequence of terminal and/or non-terminal 
symbols associated with the current symbol into the 
stack memory, and 

a code generator that generates from the sequence 
of terminal symbols executable machine code, or 
intermediate code to be executed on a virtual machine. 

5 . Advantages of the Solution in Accordance with the 
Invention 

With the solution in accordance with the invention, 
enabling data reduction in dependence on the 
programming-language syntax employed, it now becomes 
possible in an especially advantageous manner to remove, 
asymptotically almost completely, redundancy produced in 
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texts during their construction because of the need to 
follow certain grammatical rules. 



The invention has proved this by analytical calculation 
of the capacity of its data stream, encoded in a syntax- 
directed manner, using the example of a language with 
characteristics typical of programming languages 
(mathematical expressions, if-then-else construct, 



J=M etc.). In this process it was shown that an appreciable 

jo, 10 amount of the redundancy that conventional methods 

£ (Lempel-Ziv, Huffman) are designed to address has not 

%l previously been eliminated by these methods. 



The new method of syntax-directed coding, in contrast, 
15 is capable of reducing the amount of data by an 
additional ca . one-half in comparison, for instance, to 
a result obtained with the Lempel-Ziv compression 
procedure . 



20 Thus the method in accordance with the invention, 
together with the apparatus specified for its 
implementation, is predestined for all applications that 
involve the transmission or storage of syntactically 
structured texts. Examples include the transmission of 

25 Java applets in the Internet or an intranet, the 
transmission or storage of Postscript files, the 
transmission and storage of MPEG 4 -encoded video data, 
and the use of higher protocols in communication. 
Especially for the transmission of programs, such as 

30 Java applets, syntax-directed coding (SDC) is 
particularly suitable because the parser is a component 
of both a compiler and an SDC coder, so that SDC can be 
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organically integrated into the data flow from the 
writing to the running of the program. 

6 . Mode of operation of the invention 

5 

6 . 1 Syntactically structured source 

The invention takes as a point of departure a formal 
language represented by a set of chains of alphanumeric 

10 characters. The text is put together according to a 
grammar assigned to the language. A grammar is both the 
mathematical system used to define a formal language, 
and a set of rules to determine the syntactic validity 
of a specific sentence. Only context-free grammars are 

15 considered here, because higher programming languages 
(C, C++, Java, etc.) are described almost exclusively by 
context-free grammars. The following definitions apply: 

Definition 1 A context-free grammar is a 4-tuple G = 
20 (N, T,P,S) , where 

N is the set of non-terminal symbols , 
T is the set of terminal symbols , and 
P is the relation N x (N U T) *. 

S is a special non-terminal symbol, also called 
25 start symbol. 

Definition 2 An element (A, /?) e P is called production 
and is abbreviated A —> J3. 



30 



A production (or derivation) is thus a replacement rule 
for a non-terminal symbol A, such that this non-terminal 
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symbol is replaced by a string (chain) /? composed of 
(non-terminal and) terminal symbols. 

When there exist several productions {A, pi) , {A, p 2 ) , 
5 . . . , (A, p n ) € P, for this we write: 

A -> p! 
I P2 

I fin 

10 

Definition 3 The set P A0 £ P of all possible derivations 
for a non-terminal symbol Ao 

Pao = { (A,fi) € P : A = Aq) 
is also called the set of the alternatives for A 0 . The 
15 decision to use a specific production (A 0 ,Pk) e P A or (k = 
1,2, n) is called selection. 

Example 1 All arithmetic expressions in the variables a 
with the operations + and * as well as arbitrarily 
20 nested brackets (,) are a formal language generated by 
the grammar 

G = {{E,T,F} f {a,+,*, (, ) } ,P,E) . 
P consists of the following productions: 

E —> E+ T 

25 I T 

T -> T*F 

I F 
F -> (E) 
I a 
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a * a + (a * a + a) * a, for example, is a valid sentence. 
In generating a sentence, further strings of non- 
terminal and, where appropriate, terminal symbols are 
derived by application of suitable productions to the 
5 non-terminal symbols until ultimately the string 
consists only of terminal symbols. In the example given 
above, the valid sentence consists only of the terminal 
symbols a, +, * f ( , ) . 



10 6.2 Principle of Syntax-Directed Coding 

Information always flows into a text being generated 
when during its construction decisions are made. 
Decisions are to be made when a particular one of 
15 several possible source symbols is to be chosen or, in 
the case of a syntactically structured source, out of 
several applicable productions A —> A —> (5 2f ■ ■ . 

precisely one production actually is applied (selection, 
cf . Definition 3) . 
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SDC encodes, instead of the string of terminal symbols, 
the selections in the order in which they were made 
during the construction of this string (the text) . 



25 Definition 4 E = {o lf a 2 , a n } is called SDC 

alphabet , with 



n = max { I P A \ } 
A 



30 All the elements in each set of alternative P a are 
numbered (arbitrarily) with SDC symbols <7i g E: 
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Definition 5 z A : P A — » £ is an (arbitrary) injective 
mapping . 

The concrete definition of z A can in general be 
5 dependent on P A . 

As an illustration of the invention the following 
exemplary embodiments are described in detail and shown 
in Figures 1 to 6, as follows. 

10 

6 . 3 Brief Description of the Drawings 

Figure 1 is a block diagram of an SDC transmission 
system 

15 Figure 2 shows alternatives of the grammar G and 

selections to which SDC symbols have been attributed 

Figure 3 is a parse tree of the sentence a * a + (a * 

a + a) * a with a resulting linear SDC symbol sequence 

1212 2 21211212 222222 
20 Figure 4 is a flow diagram of the stack machine; 

Figure 5 is an example of an SDC transmission 
system with receiver-side source-text output 

Figure 6 is an example of an SDC transmission 
system with executable Java applet or application as 
25 output 

The functioning of a transmission system with SDC is 
illustrated in Figure 1. In principle transmission and 
storage are equivalent with respect to the way the 
30 coding works. 
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The source delivers a syntactically structured stream of 
symbols (source program) . An elementary component of the 
SDC encoder is a parser, the task of which is to 
retrieve from the initially linear sequence of source 
5 symbols the grammatical structure of the sentence (the 
source program) . This grammatical structure is 
represented in general by a parse tree, which in turn 
serves as input to the encoder. The encoder encodes the 
parse tree to generate a linear sequence of SDC symbols, 

10 such that at each moment the current SDC symbol is 
output and transmitted in dependence on the current 
selection at that moment. The SDC decoder first 
reconstructs the parse tree from the linear sequence of 
SDC symbols and then, by traversing the parse tree, 

15 reproduces the sequence of terminal symbols sent out 
from the source. 



6 . 4 Encoding 



20 The elementary task of the parser comprises the 
reconstruction of the selections made by the source 
during construction of the sentence. A parsing step thus 
amounts to determining the specific production {A, /?) e 
P A that the source had selected for constructing the 

25 text at this point. As stated in Definition 5, the 
encoder assigns to the selection thus _ determined the 
associated SDC symbol a k = z A ( (A, J3) ) and attributes a k 
to the corresponding node in the parse tree. 
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By traversing 
sequence of SDC 
examples given 



the parse tree, ultimately the 
symbols is generated and output, 
here, a depth-first algorithm is 



linear 
In the 
always 
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used for this purpose. The process of encoding will now 
be explained in greater detail with reference to an 
exemplary parse tree and the underlying grammar from 
Example 1. 

5 

Example 2 The context-free grammar of Example 1 is. 
assumed. All alternatives, i.e. all possibilities for 
deriving each non-terminal symbol, are shown graphically 
in Figure 2. Because the greatest set of alternatives 
10 comprises precisely two elements , the SDC alphabet Z = 
{1, 2} suffices. To each selection of all quantities of 
alternatives an SDC symbol is attributed. For example, 
the selection of the specific production T — » T * F is 
assigned the SDC symbol 1. 

15 

A valid sentence in this grammar is thus, for example: 
a * a + (a * a + a) * a. The parse tree for this sentence 
and the generation of the SDC symbol sequence by depth- 
first traversing are shown in Figure 3. 

20 

6 . 5 SDC with arithmetic encoding 

The method that has just been described is efficient 
only if 

25 1 . I P J = const V A, and 

2. fii)\A) = const V A, /3±, 

where P( (Al fi±)\a) is the conditional probability of 
making the selection that generates /?i from the set of 
30 alternatives P A . 
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This deficiency can be alleviated by arithmetic encoding 
[3] of the SDC alphabet S as stated in Claims 6 and 7. 
The probability distributions of the SDC symbols used 
for arithmetic encoding should be adapted in an adapter 
5 according to Claim 10 in which case the adaptation must 
be carried out separately for each set of alternatives. 
Instead of the linear sequence of SDC symbols, the bit 
stream generated by the arithmetic encoder is 
transmitted . 

10 

6 . 6 Decoding 



Decoding is the inverse of the encoding process- The 
decoder formats the SDC code back into the original 
15 sentence (strings of terminal symbols) . It operates 
according to the procedure described in Claims 1, 3 and 
4, 5 and is implemented by a stack machine [4] according 
to Claims 8 and 9. 



20 A flow diagram for a method according to these claims in 
shown in Figure 4 . 



To explain operation of the stack machine the following 
abbreviations are employed: 

25 

f} k String of - terminal and non-terminal- symbols 
a SDC symbol 

V Terminal or non-terminal symbol 
S Start symbol (non-terminal symbol) 
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The decoding thus proceeds in the following steps (the 
step numbers corresponding to the reference numerals 
shown in Figure 4 : - 



1=. 



5 1. The development of a valid sentence according to 
the grammar (a program) begins with the start 
symbol S. The start symbol S is placed on the 

.P stack. 

. »«& 

ill 2. The uppermost symbol V is read from the stack. 

i£ 10 3. Check whether V is a non-terminal symbol. 

h h 4 . From the linear input stream of SDC symbols the 

p 

%1 next SDC symbol a is read. 

5. In case V is a non-terminal symbol, the derivation 
with V is further developed. By way of a a 

15 selection (V r f3 k ) = z" 1 (a) is made for V from the set 

of alternatives of V — > J3i\ J3 2 \ ... l/? n - In accordance 
with the specific selection, V is then replaced by 
the string J3 k . 

6. The string /3 k of symbols (terminal and/or non- 
20 terminal symbols) is placed on the stack. 

7. If V is a terminal symbol, it is output. 

8. Check whether the stack is empty. 

9. The stack machine terminates when the stack is 
empty. 
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The strings of terminal symbols output successively, 
when linked together, finally yield the sentence 
originally encoded from the source. 
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7. Exemplary Embodiment: Preferred Form of Application 

The two exemplary embodiments shown in Figures 5 and 6 
differ merely in the form in which data are sent to the 
receiver, namely as an output of source text 
corresponding to the method according to Claim 3 or as a 
direct output of an executable Java applet or an 
executable Java application, corresponding to the method 
according to Claim 4. 



A description in greater details of these elements and 
'Si their operation in the embodiments shown in Figures 5 

;,L a and 6 is as follows. 

tli. 

%\* 

15 1. Java source 

A Java source 1 corresponds to an ASCII text based 
on the Java grammar, which consists of Java program 
text plus comments, blank spaces etc. 



20 2. SDC encoder 

The SDC encoder 2 is composed of a compiler front 
end, comprising a scanner 3, a parser 4 and a 
symbol-table memory 6, together with a parse-tree 
processor 5, an adapter 8 and modules for 

25 compression and concatenation of the generated 

data. 

3. Scanner 

The scanner 3 reads the source text, removes the 
30 text sites that are unimportant for the program 

such as documentation, comments, blank spaces etc. 
and sends (terminal) symbols to the parser 4. 
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4 . Parser 

The parser 4 interprets the (terminal) symbols from 
the scanner 3, tests whether the symbol sequences 
5 agree with the grammatical rules for Java and sets 

up the specific parse tree for this program. 

5. Parse-tree processor 

During the initialization of parse-tree processing 
10 all nodes of the parse tree are assigned SDC 

symbols specified for each type of node. The 
subsequent traversing of all nodes of the parse 
tree causes an SDC symbol to be output at each 
crossing of a node. These SDC symbols are sent to 
15 both the adapter 8 and the arithmetic coder 10. In 

addition, the parse-tree processor 5 transmits the 
current node type to the adapter 8 . 

6. Symbol-table memory 

20 The symbol-table memory 6 contains the identifiers 

of the Java program text extracted by the parser 4. 

7 . Zip-compression device 

The content of the symbol table 6 is compressed by 
25 a method according to Lempel-Ziv-Welch [2] . 

8. Adapter (transmitter-side) 

During the process of initialization the adapter 8 
enters initial probability distributions for each 
30 node type into the probability-distribution table 

9. During processing of the SDC symbols received 
from the parse-tree processor 5, and with reference 



to the current node type, at each step the 
probability distribution of all SDC symbols of the 
current node type is adapted and entered into the 
probability-distribution table 9. The adapter 8 
simultaneously makes available to the arithmetic 
encoder 10 the probability distribution associated 
with the SDC symbol currently being processed. 

Table of the probability distributions of the SDC 
symbols 

For each node type, the table 9 contains a 
probability distribution of SDC symbols that is 
peculiar to that node type. 

Arithmetic encoder 

The arithmetic encoder 3 receives from the parse- 
tree processor 5 the next SDC symbol to be encoded 
and encodes it with reference to the probability 
distribution made available by the adapter 8. The 
output of the arithmetic encoder 3 is a binary data 
stream. 

Data concatenation 

The binary data stream from the arithmetic encoder 
10 and the symbol table 6 compressed by the 
compressor 7 are combined to form a data packet. 

HTTP server 

On the HTTP server 12 the data packet created in 
the data concatenation 11 is deposited and made 
available to be called up from a mass storage 
device . 



Intranet /Internet 



HTTP client (Internet browser) 

The browser 14 has the task of downloading the data 
packet from an HTTP server 12 by way of the 
intranet/Internet 13 and initiating the process of 
decoding by the SDC decoder 15. 

SDC decoder 

The SDC decoder 15 comprises substantially of the 
arithmetic decoder 17 and the stack machine 22. The 
result of its operation is the sequence of 
(terminal) symbols that correspond to the original 
Java source code 1 (cf. 3 and 4). 

Data extractor 

Here the binary data stream from the arithmetic 
encoder 10 and the symbol table compressed by the 
zip compressor 7 are extracted from the common data 
packet and sent on, the former to the arithmetic 
decoder 17 and the latter to the zip-decompressor 
18 . 

Arithmetic decoder 

The arithmetic decoder 3 adaptively decodes the 
binary data stream produced by data extraction 16, 
taking into account the probability distributions 
associated with the SDC symbols that have been made 
available by the receiver-side adapter 20, and at 
each decoding step transmits an SDC symbol to the 
stack machine 22. 
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18 . Zip-decompressor 

The symbol table from the coder 2, compressed by 
the Lempel-Ziv method 7 and received from the data 
extractor 16, is unpacked and entered into the 
5 symbol-table memory of the decoder 19. 

19. Symbol-table memory of the decoder 

20. Adapter (receiver-side) 

10 The initialization of the receiver-side adapter 20 

is analogous to that of the transmitter-side 
adapter 8. While decoding proceeds, at each 
decoding step the adapter receives an SDC symbol, 
adapts the probability distribution of the SDC 

15 symbol of the current node type and enters the 

adapted probability distributions into the 
probability-distribution table 21. At the same 
time, the receiver-side adapter 20 supplies to the 
arithmetic decoder 17 the probability distribution 

20 that is valid for the next decoding step. The 

selection of this distribution is determined by the 
node type determined by the stack machine 22 and 
expected in the next decoding step. 

25 21. Table of probability distributions of the SDC 
symbols 

For each node type the table contains an individual 
probability distribution of the SDC symbols of this 
node type. 
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Stack machine 

The initialization of the stack machine 22, the 
processing of the stack memory 23 and the output of 
terminal symbols is represented in Figure 4 as a 
flow diagram. In addition the stack machine 22 
transmits the following data to the adapter 20: 

(a) the current node type, for adaptation of the 
probability distributions of the SDC symbols of 
this node type; and 

(b) the next node type, the SDC-symbol probability 
distribution of which the adapter 20 must supply to 
the arithmetic decoder 17 in the next decoding 
step . 

Stack memory 

Stack memory 23 for the stack machine 22. 
Java sink 

Retrieved Java program text without comments, 
superfluous blank spaces etc. 

Code generator 

The code generator 25 corresponds to a compiler 
back end and generates byte code for the JVM 2 6 in 
dependence on the sequence of (terminal) symbols 
provided by the stack machine 22. 

JVM 

The virtual machine for Java programs (Java Virtual 
Machine) . 
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Key to reference characters 
Reference characters in Figure 4 

fi k String of terminal and non-terminal symbols 
cr SDC symbol 

V Terminal or non-terminal symbol 
S Start symbol (non-terminal symbol) 

1. The development of a valid sentence according to 
the grammar (a program) begins with the start 
symbol S. The start symbol S is placed on "the 
stack . 

2. The uppermost symbol V is read from the stack. 

3. Check whether V is a non-terminal symbol. 

4. From the linear input stream of SDC symbols the 
next SDC symbol cr is read. 
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5. If V is a non-terminal symbol, the derivation is 
continued with V. By means of a a selection (V,/3 k ) = 
z' 1 (a) from the set of alternatives of V -> J3 2 \ J3 2 \ ... 
\p n is determined for V. According to the particular 

5 selection made, V is replaced by the string fi k . 

6. The string /3 k of symbols (terminal and/or non- 
terminal symbols) is placed on the stack. 

7. If V is a terminal symbol, it is output. 

8. Test whether the stack is empty. 

10 9. The stack machine terminates when the stack is 
empty. 

Reference characters in Figures 5 and 6 

1 . Java source 
15 2. SDC encoder 

3 . Scanner 

4 . Parser 

5. Parse-tree processor 

6. Symbol-table memory 

20 7. Zip-compression device 

8. Transmitter-side adapter 

9. Table of probability distributions of the SDC 
symbols 

10. Arithmetic encoder 
25 11. Data concatenation 

12. HTTP server 

13 . Intranet /Internet 

14. HTTP client (Internet browser) 

15. SDC decoder 

30 16. Data extractor 

17. Arithmetic decoder 
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18. Zip-decompressor 

19. Symbol-table memory of the decoder 

20. Adapter (receiver side) 

21. Table of probability distributions of the SDC 
5 symbols 

22. Stack machine 

23. Stack memory 

24 . Java sink 

25. Code generator 
10 2 6. JVM 



